Draft syllabus for a 10-hour graduate short course in Probability and Statistics for Earth and Planetary Scientists
P.B. Stark.
Last edited 19 April 2015
Lecture 1: Probability
- Counting and combinatorics
- Sets: unions, intersections, partitions
- De Morgan's Laws
- The Inclusion-Exclusion principle
- The Fundamental Rule of Counting
- Combinations
- Permutations
- Strategies for counting
- Theories of probability
- Equally likely outcomes
- Frequency Theory
- Subjective Theory
- Shortcomings of the theories
- Rates versus probabilities
- Measurement error
- Where does probability come from in physical problems?
- Making sense of geophysical probabilities
- Earthquake probabilities
- Probability of magnetic reversals
- Probability that Earth is more than 5B years old
Lecture 2: Probability, continued
- Axiomatic Probability
- Outcome space and events, events as sets
- Kolmogorov's axioms (finite and countable)
- Analogies between probability and area or mass
- Consequences of the axioms
- Probabilities of unions and intersections
- Bounds on probabilities
- Bonferroni's inequality
- The inclusion-exclusion rule for probabilities
- Conditional probability
- The Multiplication Rule
- Independence
- Bayes Rule
- Random variables.
- Probability distributions of real-valued random variables
- Cumulative distribution functions
- Discrete random variables
- Probability mass functions
- The uniform distribution on a finite set
- Bernoulli random variables
- Random variables derived from the Bernoulli
- Binomial random variables
- Geometric
- Negative binomial
- Hypergeometric random variables
- Poisson random variables: countably infinite outcome spaces
Lecture 3: Random variables, contd.
- Random variables, continued
- Continuous and "mixed" random variables
- Probability densities
- The uniform distribution on an interval
- The Gaussian distribution
- The CDF of discrete, continuous, and mixed distributions
- Distribution of measurement errors
- The box model for random error
- Systematic and stochastic error
- Independence of random variables
- Events derived from random variables
- Definitions of independence
- Independence and "informativeness"
- Examples of independent and dependent random variables
- IID random variables
- Exchangeability of random variables
- Marginal distributions
- Point processes
- Poisson processes
- Homogeneous and inhomogeneous Poisson processes
- Spatially heterogeneous, temporally homogenous Poisson processes as a model for seismicity
- The conditional distribution of Poisson processes given N
- Marked point processes
- Inter-arrival times and inter-arrival distributions
- Branching processes
Lecture 4: Expectation and Simulation
- Expectation
- The Law of Large Numbers
- The Expected Value
- Expected value of a discrete univariate distribution
- Special cases: Bernoulli, Binomial, Geometric, Hypergeometric, Poisson
- Expected value of a continuous univariate distribution
- Special cases: uniform, exponential, normal
- Expected value of a multivariate distribution
- Standard Error and Variance.
- Discrete examples
- Continuous examples
- The square-root law
- Standardization and Studentization
- The Central Limit Theorem
- The tail-sum formula for the expected value
- Conditional expectation
- The conditional expectation is a random variable
- The expectation of the conditional expectation is the unconditional expectation
- Useful probability inequalities
- Markov's Inequality
- Chebychev's Inequality
- Hoeffding's Inequality
- Jensen's inequality
- Simulation
- Pseudo-random number generation
- Importance of the PRNG. Period, DIEHARD
- Assumptions
- Uncertainties
- Sampling distributions
Lecture 5: Testing
- Hypothesis tests
- Null and alternative hypotheses, "omnibus" hypotheses
- Type I and Type II errors
- Significance level and power
- Approximate, exact, and conservative tests
- Families of tests
- P-values
- Estimating P-values by simulation
- Test statistics
- Selecting a test statistic
- The null distribution of a test statistic
- One-sided and two-sided tests
- Null hypotheses involving actual, hypothetical, and counterfactual randomness
- Multiplicity
- Per-comparison error rate (PCER)
- Familywise error rate (FWER)
- The False Discovery Rate (FDR)
Lecture 6: Tests and Confidence sets
- Tests, continued
- Parametric and nonparametric tests
- The Kolmogorov-Smirnov test and the MDKW inequality
- Example: Testing for uniformity
- Conditional test for Poisson behavior
- Permutation and randomization tests
- Invariances of distributions
- Exchangeability
- The permutation distribution of test statistics
- Approximating permutation distributions by simulation
- The two-sample problem
- Testing when there are nuisance parameters
- Confidence sets
- Definition
- Interpretation
- Duality between hypothesis tests and confidence sets
- Tests and confidence sets for Binomial p
- Pivoting
- Confidence sets for a normal mean
- known variance
- unknown variance; Student's t distribution
- Approximate confidence intervals using the normal approximation
- Empirical coverage
- Failures
- Nonparametric confidence bounds for the mean of a nonnegative population
- Multiplicity
- Simultaneous coverage
- Selective coverage